Auditory Cortical Representations of Speech Signals for Phoneme Classification

نویسندگان

  • Hugo Leonardo Rufiner
  • César E. Martínez
  • Diego H. Milone
  • John C. Goddard
چکیده

The use of biologically inspired, feature extraction methods has improved the performance of artificial systems that try to emulate some aspect of human communication. Recent techniques, such as independent component analysis and sparse representations, have made it possible to undertake speech signal analysis using features similar to the ones found experimentally at the primary auditory cortex level. In this work, a new type of speech signal representation, based on the spectrotemporal receptive fields, is presented, and a problem of phoneme classification is tackled for the first time using this representation. The results obtained are compared, and found to greatly improve both an early auditory representation and the classical front-end based on Mel frequency cepstral coefficients.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain

This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...

متن کامل

Early-latency categorical speech sound representations in the left inferior frontal gyrus

Efficient speech perception requires the mapping of highly variable acoustic signals to distinct phonetic categories. How the brain overcomes this many-to-one mapping problem has remained unresolved. To infer the cortical location, latency, and dependency on attention of categorical speech sound representations in the human brain, we measured stimulus-specific adaptation of neuromagnetic respon...

متن کامل

Learning sound categories: A neural model and supporting experiments

Our ability to discriminate sounds such as vowels is not uniform throughout acoustic space. That is, our auditory perceptual spaces are warped representations of acoustic space. One example of auditory space warping, the perceptual magnet effect, arises from exposure to the phonemes of an infant’s native language. We have developed a neural model that accounts for this effect. The model is base...

متن کامل

ning Sound Categories: A Neural Model and Supporting Experiments

Our ability to discriminate sounds such as vowels is not uniform throughout acoustic space. That is, our auditory perceptual spaces are warped representations of acoustic space. One example of auditory space warping, the perceptual magnet effect, arises from exposure to the phonemes of an infant’s native language. We have developed a neural model that accounts for this effect. The model is base...

متن کامل

Pii: S0304-3940(99)00336-5

Hemispheric asymmetries in response to speech sounds are well documented. However, it is not known if these asymmetries re ̄ect only cortical hemispheric specialization to language or whether they also re ̄ect pre-conscious encoding of signals at lower levels of the auditory pathway. This study examined differences in neural representations of signals with acoustic properties inherent to speech i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007